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Abstract 



The present invention relates to a polyester synthase gene coding for a polypeptide containing the 
annino acid sequence of SEQ 10 NO:2 or a sequence where in said amino acid sequence, one or more 
amino acids are deleted, replaced or added, said polypeptide bringing about polyester synthase activity; 
a gene expression cassette comprising the polyester synthase gene and either of open reading frames 
located upstream and downstream of said gene; a recombinant vector comprising the gene expression 
cassette; a transformant transformed with the recombinant vector; and a process for producing 
polyester by culturing the transformant in a medium and recovering polyester from the resulting culture. 
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u < \mm(or % ymti^x^. a^s o < ^ n/c 

[iftScJI 7 ] ^-') xX7^;u«^S^«e^OT»Cc?? 
tiS^JI 8 ] n^m 1 « 0 < « 2 iaiS(?)5i< U x x-r ;u 

^i^a i -r 3i< X X f-;ucDMiS:^rio 
R 

I 

HO-CH-CHz-COOH 

{n^mi 2) ;i<';xxf-;u?!^5. O-t Fn + 
i^y^U-- h-3-fc: Fa=^t/^=Mf>^x-h) 
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CO GO 1 1 

tj xx7";KoS5!ii:firffi&csiT^o 

[0 00 2] 

10 ->^^U-F ( PC3HB) ) ^^-^fiSb. x^;W^'-cDffi^ 

€>o ?S^!B3(*rt3&^6}tttHL/c PC3HB) 180"CgStC 

:7*^x5^-;;^<i:Lraa5nri^^o ^fc poh 

[0 00 3 ]ia^. 3 -t FU^v'^/'^U-- h(3HB) <b 
3 - t Fp^^v'^+I^-^x— h (3HH) tOZJiSL^^M-^ 
;j^ijXX7^;l/ P(3HB-CO-3HH) ^J^ZJ^^OUB^i^C-Oi.^ 
X. fF^. ^^t^^Ji^tl. tct^^t. 5-93049^ 

l>^o Cn6(0i&fE© PC3HB^co-3HH) ^tS^fttDSf^ 
ffi^i. ±cJ: «3 #it U /cTX a^:f X • =^-^trx (Aeromo 
nas caviae) ^^CiT:t U >M^:t' J -":^:t >f /t'^&^e 
^^^T^^OrA-So P(3HB-co-3HH) 

30 itS'^^*^. 3hh:3.x V h ^momM tti> CCj^H^jb^ 
tc*>iinxr#^ci3&5?8e>*^tc$n"CiiS (y. doi. s. 

Kitamura. H. Abe, Macromolecules 28, 4822-4823 (1 
995)) o Lf)^LtS::fyS^^ !^||^5-93049#^#a*J<fcO'Jt$ 

ra^7-265065^<&iacciBis©s*^:^6fer». >; XX 

($£^a*4^^F*9CD^'Jxxf*;i/^Wa) 

fci^. P(3HB-CO-3HH) *tg^;J<ijXXf*;U^]SiR^-Ci 
40 [0 00 4] 

[0005] 

50 «il{E^icWrar^±8aRan=^SSO::d--:7'>U--r'/> 



3 

[0 0 0 7 ] $6^C. MiS^ xXr-^bm^ 

^\^<m-^m^tifcmn^^^. xy^;u-coAtK 

[0 0 0 8 ] CCr. ^^H^O^KUxXf-^US^^^fi 

[0 0 0 9 ] a B?tB3ii Uxxf-^I/S^ 

[0 0 1 0] $e»CC. ^I^H^ti. HulBff5®K^f**^tt6 
iCigSU. f#e>nS^^:^^6>i<';xX'r>H/^:ggaX-r€> 

c<b^iRF8i<hT^jJ^yxxf';i/0^:???£'C*S. 4o 
xx-rjuiorti, mti. : 

[0 0 1 1 ] 
Ut2] 

R 
I 

H0~CH-CH2-C00H 

[0012] (Rittykmm^x^tmm^ i - 4 (dt^u^^ 

3- t Kn=^^$^-Ns^tfyx- h) ^^^S^'Ai^S^t*) ibs 50 
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[0013] 

[^?BC7)||SgCDff^«gl 

CD ^yxxf-;um^l^jie^(Z>^n-x>^ 

*^iacr);i<yxxr>ri'm^i^^jie^ts. rxn^:^x 

*WT€»M*fc;Cr^6S^fef*D N A 4 

X (Aeromonas cavlae) ^^^Cf 6tl-5>o 
[0014] SSfel*DNACDi@Sti^ai©^ffi*fflC:»-5 

X'^Affi (Currnt Protocols in Malecular Biology,! 
2.4.3 H. John Wiley fiSons UlKg, 1994^) IfCC 

[0 0 15] ±iECD^Stc^»5f#6n/cDNA«:®^3:c 
*»M»* (ta|;t«Sau3Al. BanHI. Bqlll^) XSR^^M 

gin m X^mbtc-^^^-fy^y-iyBy^n 

[0016] ^^^-(>at. m^m^^xsmm^iC^m 

-^^^f-iUr^. m^^tEM&L3 . M13 . Agtn^^&S* 
Cf6tl. :7'^X5 K-^^^^-iLTt^. ^(^^ipBR322.. 
pUC18 . pBluescript II ( STRATAGENE?±S3 ) ^;&i^Cf 6 

[0 0 17] DNA»r)t<t^^^--»r>T-cb^afe$if 
CCt^. ^»]CDDNA y:^'— fe'^ffll^-So -^LT. DNA 

[0018] m^dk^^(fcmik^^'^ ^-^mxr^tfC 

AJlfflOit ^ iJJliy^Am ( Lede rberg , E , M . 
etal., J. Bacteriol. lis, 1072(1974)) ^XU^J? h Pjj< 
U — S^aVffi (Current Protocols in Ktolecular Biolo 
gy, 1^, 1.8.4 H, 1994^) ^J^ffl-r^C <!:3&5r#. 

^ — i^>^S (Current Protocols in MDlecular Biolo 
gy. 1«. 5.7.1 M. 1994^) l?*ffi«-r^Ci:d5-C* 

(Cigapack II : STRATAGENE ttSS^f ) ^fflli-S C i 

[00 19];^^. rxD-t:^^ • t:rxcojj<yxx 
r .'bfi'^B^iae^-^r^tf DNA Krit^WS /ci6(Z) 7* u 
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5 

tcoc^r^j. mc^mmti^oy^otmhtixy.^^ (peop 

les. O.P. and Sinskey, A.J., J .Biol .Chem. . 264, 15 
293 (1989) ; Huisman, G.W, et al . , 3 .Biol .Chem. ,26 

6, 2191 (1991) ; Pieper,U. et al . , FEMS Microbiol. L 
ett., 96, 73(1992)ffi) „ ^CC. Ctlh<Dr ^ ^WSi 

m<D'>^. «i?3nrt^^2o<7)^iiic^jiiRu. -en?: 

•-CC(C/G)CC(C/G)T0GATCAAa/OAAGT(T/A) (T/C)TACT 
/C)ATC-3' (i2?»JS-^7) . SC>'5'-(G/QAGCCA(C/OGC 10 

(G/C)CrCCA(A/G)TC(G/OGCCCACCA-3' (ge^'J#-^8) t? 

as n-s 2 a^c7):t n 5? ^ u :t ^ Fx^i^cf 6n^;&^c 

[0 0 2 0] CtXho:>ir')^y^^\y:^^Y^y''y 

>1< y >^ ^--ffaiSSJS (PGR; Molecular Clonin 
g, 2^. 14.2M, 1989^) ^tf^o ^hX. PCRCCcfc 

[0 02 1 ] ;^tC. C<7)gi5^iti|ii»f>t*®^^^^^*ffl 

— y^^:7' V -fe'— >'3>j|:tf^ (Curmt Protocols i 
n Molecular Biology, 1^, 6.0.3 M. 1994^) o 
[0 02 2 ] ncix-^yN^':;rt;^^-fe'^^3>(C<fcO:^ 
^U-x>^$n/cAPS/^)^6TJl^:^ Uffi (Currnt Pro 
tocols in Molecular Biology, 1^ .1.6.H , 1994^) 

[0 0 2 3 ] ±IBDNA»rK-CDiS»gem^^^^. 

^?(lx.«1^>:^-;ffi (Molecular Cloning, 2^, 13. 

3M, 1989^) ^(c<i:o-ctf^ci*i'c^. i&mmmn 30 

8&^WS?^. {^^^i373A- I>R >'-i^x>-!t- (Applie 
d Biosystemstt) ^J&fflC^rtf ^ C <h*^r^ -S. 
[0024] iB9»J#-^ 1 0C*:^eB<D5i< V xxf^US^W 

[0 0 2 5 ] ?^C*>\ ±f5X^CD^St^, ^*a<DlH5f4?^ 

fi^Si^^Wiii (Current Protocols in Molecular Bio 
logyl^, 8.1.1 M, 1994^) J: 0 ^ C i^^r 

^^CCjr^r. XiiS^feftDNA^^MiL'/cPCRS 
t^cJ^-oX. ^^i.H3:te±SSiS?iJ*Wr'E>DNA8fTM-^:7- 
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[0 0 2 6 ] (2) ^Sli^f^Of^iS 

Sffl^gii^-^ ^ ^ - ^f^is-r Steffi I i/c^^-^ ^ ^ - 

[0 0 2 7 ] TJU;^yy4^:^ii^cBi-r^ss*^^. 

^RK2lSM|3/j5;§rWr^plA2917(ATCC 37355) . 
t3:RSF101O)t^^a,#.^W"r'SpJRD215(ATCC 37533) ^i)^ 

CO 02 8] y'U^-'^-t{^X\t. ^^tfi-c^^-C^ 

^^(oxhti\ti.^'rti^m^^xhx\.K m^\t. trp :7* 

lac Pl P 

a ::?'n-t-d?-. T7^a^-a?-3^cC<!:(D::Acisa'^r^T'- 

•Y:**>*fflC^5:^rS (Current Protocols in Molecular 
Biology, 1^, 1.8. iS, 1994^) . XU^H3d<U- 
>'H>ffi (Current Protocols in Molecular Biology, 1 
^. 1.8.4 M, 1994^) ^J&^^Cf ^n-So 

[00 2 9] mm^^m^t iyxm\^^hm^\t. 

^-thX. «?!|^«YEpl3 . YCp50^:^iW6n-S>o 
P^-:$r-cbLr«. p!lx.{3:gal l^P^-t^?-. gal 
io:/n*-^-^:*s^Cf6n^o BS-v-offl^AftD N 
AO^A^'ffiiLr^i. t^xf^xu^ hn7i<u->^3> 
S (Methods. Enzymol . .194,182-187(1990)). Xr^jiP 
:7''5'::^ FS (Proc. Natl. Acad. Sci.USA. 84, 1929-1933(19 
78)). i^KU^-^AiS ( 3. Bacteriol.. 153, 163-168(198 

3)) mi^m^^hti^. 

[0 0 3 0 ] mmmm^m^t \^xm\^>^m^\t. 

^^^-i OrFd^JipcDNAI. pcDNAI/Amp (-YVfc^FP 

A(D^A:^iS <h orti. c^ijxt^:. xu^j; h u^i<u-i^3 

[0 0 3 1 ] ccr. BuiecOct^ccur^^^sn/c^s 
ie?u«. 3i<vxxf*^ua^#^3te^-cDt3t;&^cc. 

t>-^. ?j<';xX7-;UM'^P^aeiPiJ. m-cD-7*n-t- 
^-MilSc7)^iBTCCia>>:c< i^2jl(DORF ^^4>CC:^- 
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( 0 0 3 2 } <J:r.X7-)Vm^mm^Blf-(D±mi^iAm 
■r-SORF^rJilT rORF 1 J ilHV TiStCiia-T'S 
ORF%WTrORF3J tl^^, 0RF1«> 

^^mkm&^<Dii<Dt miotic, */c. orf3«> 

ir— b' (i^(C (R) -i|#SWx>'-r^U-Co At K7 3f 
Ote. 

[003 31 ^^m-cit. m I icmrj: ^mmm lo 

MJ« (01 (1) {cfcc^r r-35/-ioj <t*^) . ^Kvx 
:^f-jvm'^mmM&^. orf iso'orf s^^tfEc 

0RI»fJt*i'O-->i/Lfc (0 1 CD ) . CCD»r>t?: 
[0 03 4] ^X«C, EE32(c*sl,^rORF i:5^«ORF 

3©c»-rn*i-:^x«S:«r^^^^3-i*fcif>T- (ite^^. . 

CitciO. ^';xxf^l/>&^SJ:<^-r-5Ci*s-C 

[003 5] EE 32f . f631$l|ffll««<t0 F R 1 OMm 20 
g8>&MJS i ©ft! , SO- O F R 1 ©fSiR#±mi^ i U x 

$lMffiE^BqlII gPfi^r^AC. Bqlll CCi OR F 1 * 

^<i3fe3i±s (SI (2) ). ctitmmdcLx. sj^yx;;^ 

f-;t'a-^S^Se^OfSI»?^±M^<!: O R F 3 i (DTMiC 
mmmm&antiLmM^WA C . Bamm^ffiSC J: 0 O R F 3 
(01 C3) ) . 

[0036] ORF lRafORF3©M^*^«^^S-t±S 
EE32(cot,:.-C> ±iSORF IRD^ORF 3*^<; 
^3-lf5afP=&M:^tf^«J:l-» (01 C4) ) , Uii, m 30 

Sfi'J^M^fe (Curmt Protocols in Molecular Bioloqy, 
8.1.1 M, 1994¥) fca;or3?A-rSCi3!»it?* 

[0 03 7] C®J;'5(CLrf#P.n/c-€-ti^n©jteT- 

RD215 (ATCX 37533)) CCifAL. tfe.nfciffl^^'^d:' 
— ^fflt^r. TJUf jy^'-;^ • ^- hO^r;^ (Alcali 
genes eutrophus) • PHB-4 ^ (DSM541) (^KUx;^-?-;!/ 

[ 0 0 a 8 ] (3) x;^f-;u©«ji 

x;^ 7^;KD^(i. :$^Bj©JF5«ig»{*?:tSt6f Ig 
lg«a{tX«^Si^4itc*^B^©jJ<>;xj:?.r->'U* 
^Sas#. ^iSSa(*X«^t|i|a*»6S#'Jx^r- 

i^m-cmmr^ysim. m^©««K:ffl(,^e.nsam 
©:*rffi«:ee-5-ctf*Dtis. so 
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[0 03 9] Tju* y^^micmr^'^^^xitiy^ 

©'5^©l,»-rn*>**lJSgL//c^it6. F!l^«a*?S«:O.Ol 

c 0 0 4 0 ] igu^«ai^!^©i9JifCi£;>sr*'3 , 

o, *'Jxj^f-;U^):!£CDJls^i/j:S^b©-C**), ^©W 

X. •7;Uh-X^©^*{fc!Kj7!)iWi?>n5, 

115! 2 Ji(±©ttllMa^® <!;-r-5Ci4>-C#S. 
i^l^2«±©teflgKaf)®<i:Ort3:> 

u < {i 5 X 9^ >^©BgM^« c n 69116®© 

e> T 3 — ;KD X X 7^ JU^jJs^ tf 6 n „ 

[0 04 1 ] mmmt Lxii. m«r>*xr. ^{b 

^©T>-=ex^A*a©ffe. -^:/h>. ^x+x. SSx 

[0042] ig«», astgja^«^c <b-©$f me^j^fr 

T. 25~37-C-r^R^24K5fraJ':^± (PI:S.«1~7 
h^i^-r i' •;>^©ei^!8jS*^«&Ccastt]0-Ct cf:c\ 

-ei^r. igs-rsc<!:Kj:f)'i<yxxf-'i'*mrt«:s 

[0 04 3] il^tt©7*a*-3?-^fflt,^/c^31-<e7jr 

fjU-zS-D-^jf^'^i' hb--7^t'K (IPTG) > -{> 

[0 044] tt!MB»a=S:?gii L.-Cf#6nfcffJS^» 
^r^^-rSSiftiUrta:, filx.«RPMi-i640 > OitMlgJi 

x« c n 6 ©«*cc i^m^m^mua o tcmm^m <- » 

6n€>. 3i^5%CO.#^ET. 30~37*C-C14 

[004 5] :*:|«B^fc*5t,>-c. ^ U xx-7^;l'®l3S8«F!l 

A«WT©J: ^ Ktf ^> C i *5-C# SS«AiP)^'C^dg- 
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[0046] f#6ti/c!j<Ux;:^f ;U*5ae5j<Dfc(0'r*^ 

[0 04 7 ] 
[ft3] 

R 

I 

HO-CH-CH2-COOH 

[0 04 8] (R\tAmm^x^tmmw(i--^(DTJi^^ 

^y'7'-zL::Ly htLfci^m-^i^ (7f<';xxr-;i/) 

(3 - 1 Kn^!>r^^u- h - 3 -b Kn^^V"^ 

:^1fyX'-h) ^y^J^i^m^W (P(3HB-CO-3HH) ) ^ 

«p(3HB-co^3HH) imibximm-c^mrr^mtiirm 

[0 04 9]S£*r«. 3 -t KP+^:/^U- 

h ( P(3HB) ) ^^mt7i<V (3 -b Ka=^i/r^^^- 
^- 3 -fc Fn^i^>'>''; h) ^>^A^fi^» ( p 

C3HB~co-3HV0) OM^^SCCOC^rW^S. ig^:^^^C$n# 
[0 05 0] ^m^d(D3-}:z V' U^iy-^=^-^ y V 

fflti/cSe*(DPC3HB-co-3HH)aS{3Sft (iRFPg^5-93049^^i 
$S4b'J:O'1tia^7-265065#^>$R) rtS. ^ U x;:^7^>H/CD 

[0 05 1 ] CnCC*tL. *^H^rtiPC3HB-CO-3HH) ft 

*:^?^CDiteT-?rfflC^r P(3HB-co-3HH) ftS^jJ< U x;^ 
f- war S C 6 r 1 6 . 
[0052] 

iMMm 1 D Tx^*:^x • + bxcD^tf U x;^f-;ua 50 
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mimc^ rxa^::f-X • bxco^fefiDNA^-^:/ 

[0 05 3 ] rx.a^i-X • bxF A4 4 O^^io 
0ml (DLBi^m ( 1 % -f-::^ h X^X, 0.5 %h';:/h 
0.5 h y 't^A. 0.1%^;U3-X. pH7. 

5 ) 4^. 3o'cxm^^mLMk. mt^'^'^f't^j\^hv 

^ ^;UT>"^:x»^Affi (Curmt Protocols in Molecula 
r Biology. 1^. 2.4.3.M, 1994^; John Wiley & Sons 

[0 05 4] ^htitcm^i^DN A^mmmmsauSAix 

K'^^:$?-'C*SpLA2917(ATCC37355)^^SefflL/Co 

HUM (Molecular Cloning,!^, 5.7.2 M. 1989^; Col 
d Spring Harbar Laboratory UifiR) ^MLfc^. DN 

[0 055] C<DM^DNAmmmi>fc^y\^ha ' 
1/>^^ (Curmt Protocols in Molecular Bi 
oloqy,l#, 5.7.2 H. 1994^) J: -:>r:^jSSSl7-l 
tt^f^KistfeO. Txp^:^;:^ . +i.bxS^fef^DNA 
^ y -*?#/Co 

[ 0 0 5 6 ] rx^'^::^x • ^^i' trxcDrKyxx 
-r JUm^Smae-?-*^^ D N AKit^rff-S /cd6(D:7*o 
-:/?rlBS{L'/co cti^-ctc^enr l^-S^a^Z)^ y x 
::^f')\ym^mm(DT ^ ^S^i^ijr J: < ffi^snr c^^ 2 

S L T 5 ' -CC(C/GXCCC/G)TGCATCAAa/C)MC:rCr/A) (T/C) 
TACT/OATC-3* (^J»-^7) . aO'S'-CG/OACXICACG/ 
QGCCC/C)GTCCACA/QTC(G/C)GGCCACCA-3' (BS3^lJ## 
8) ■C^$n^2aS(7:>:t'y::^^i57U:?h^ F^T^^L 

[0 05 7 ] cne.©^yrf57>7u:f^F*y^>r'7- 
L/cPCRffiKicfcor^yxxf - ^us^ssae^^gp 

^tfltSO/Co PCRti. 94°Cr3C«?. 50-C"t?30^^:R2>'72 

VX 6(^^(7) J5Jj;:§r l1f'r^;Uc!:Lrcn^30-!fH^^ ;utf 

[0 05 8 ] f#6n;^cr7'u-^$:fflCirTXD-t:^;^ • 
+ t^xg^&ft: D N A ^ y -3!)^ 6 n n x-yN ^ y 

y -if - 5> 3 >fi6^ cfc o r ^ y X f-^um^pfgiSG 

^■?:^t^:7*^X^ F^W-r^;AcJ»a?:m!SIU/c„ C(Di^ 
Bia*^6r;U*ryStcJ:-pr7'^x^ F^rHHR-r-SCi 
r y X -r^^Ufi^B^itfcT-^r-^tf D N A 
/Co CO»T)tOBqlII-EcoRI »r>tCCOliT-y*>:^'-iSCC 



1 




XL 

[0 0 5 9 ] ^ c(DffiSse?ijccoi^rtaiHitt«is^ 
*tf-:»/ctSm. ccDs.zkbpCOtSSie^jocfiCcti. Be?»js 

[ 0 0 6 0 ] */c. mm^QXktxovm^ti^^mm 
yij^wr-sffK-tcijc^r. ±iai785bpcDiasi2?»j(DTsa 

^C#aET^405bp (Dite^- (OR F 3 ) R{>'K^i^M^S 
1^. MO'tC±aE&C#SE-r'£>354bp (Ditfc^ (ORF 1 ) 
RCX^Pgp^igi^IalSL/c. ORF KDmsK^j^se 
?'J#-^3. ORF UC J: On^ K5n€>T 5 y^KF'J^ 
ie?'J#^ 4 CC, O R F 3 ©«Si2?'J^ie?fJS-^ 5 . O R 

F 3 4c<fc 0 =2 - K 5 n-ST ^ ^Kie?U^is?«J#-^ 6 cc^ 

To CCr. ORF 3«d<i;x>^r-;l/Sfe^^tCM-^-r'5 
^;l/-Co At K'^^^f-'fe'^n- h'-r-SSeT-cDfe 
c^)r*€>o ^Ur. ORF 3tcj:0:3- KSn^T^ 

■fe'rStt. ^cc (R) -i^Me^x^-rjb-c o At K^^ 

fe^*^*£SM^{i03Oio- 3i87# a r ^ 5o 
[0 06 1 ] CUSfifl^Z] T>H/^yy^-;< • ho:? 

lUse^airiHi^^n/c^^iaipMi^. orfk d<';x 

;^-rJl/fi^B^i8e^. 0RF3SOl£^^?*£^^>&$ 

t^Bqlll-EcoRI K>tCDBqHlgB{4^EcoRI'; 
rEcoRlSUfii L/. 3.2l<bp(?:>EcoRI-EcoRI»Tit ( E E 3 2 

^'C^m:^W^y''7:^ 5 KpJRD2l5(ATCC37533)CCfflA 

h n :7 r X PHB-4 ( DSM541) ^) X :^f')i'^f&mXim 

[0 0 6 2 ] r3^j::b^. ^-r, CCDffl^x.r'*^;^ ^ 
fflC^r;^J®®si7^i »^S{b;^>»U>-'i; A;S^cJ:orffJM 

n7rXPHB^4 tt^LBlgftfei.Sml cp. 30-Cr$?^^SS 

SL/Co C<7>ilftS^ffi^MBF^55««! (0.9 %y> 

mr.'Tbv^j^. o.x5%v>m—tjvoMs. o.o5%±a<fc 

r>'tX'i7A. 0.5 ^I?;!^^ h-X. 1.5 %^3^, 0.3fn 

[0 06 3] mm^:Kmm^(Dzf^:^ 5 h'i>^TJii3 v y 




(7) Itll^ 1 0- 1 0868 2 

12 

^t'^;;^ . hpr'TXAC3 2*tj (felT. AC32*|c 
iO?^) ^mtco 3^cC*>\ AC3 2*|c«. X^ri5^^ 
X^XllJJJB5f5SB^{C. PERM P- 15786<!:L-C§FI^5n 

[0 06 4] S6iC^^:t'Jn5?^U:t^K?:«t.^/cg|5 
fll[#SW^SS (Currnt Protocolsin Molecular Biolo 
qy.l^. 8.1.1 M, 1994^) ^J:oT:EE3 29Sr^^<D 

ORF imo^o^wrmc^ti^timfmmBiqMi mni^ 

10 ^AL/. Bqlii-Bqlii WrK-^^^^S-ti-'SCi^iorO 

RF l&Bl^f)^X^UtcWi>^^i'^mL. y'^^^ KpJRD 
215 tCjfAU/Co CCDM^x.:?-^':^^ K^ffil^r. ±iE 

AC 3 2 1 Wt^^. 

[006 5] mm(fC. siiiLmmm^mm('CX-:>xE e 3 
2 KM-^cDo R F 3 a{nT-©f?scc-eti-ens»ms^Ba 

ntil^&^mXL. BarnHZ-BarrHlirK'^^^S-tt^Cc!:^ 
cl:-:>rORF3ite^^^iX^l./cBf>t*f^^b. r?*^:^ 
20 5 KpjRD2i5 «:JfAL//Co ccDfflgS;^:?''^:?^ 5 F*mc> 

WTACS 2 S^tiPf-^io 
[0 06 6] IlJIiCC. EE3 2BT>t*(?:>ORF lite^ 

(DM^^c^n-en^iM^^Bgiii gp{4^. or f site 

^CDM^CC-en^nflJRSS^BamHlgpa^^AL. Bqlll 
-Bglll »rjt*5ckC/BamHI-Banm»rit*i^$ -ti^ C i CC 
<fc-orORF iae^*5<J:D^'ORF3jifc^-7&^StC^>^^fe 
L/c»rjt«:<^U. :7'^>^5 Kp3RD2i5 ^CjfAL/Co C 

U/Co mbtitcBm^^f^^. «TAC3 2 1 3»<!:P? 
[0 067] .$/E>CC. EE 3 2BrK-^^M<bL. PGR 

lE^i(J^*S^^<i:CDr0l^»AL/Co PGRfi. 5'-Acrrccc 
cx:cTcajcrGTCcxn*Q\A-3'(BSy'J#-^aa) *5J:o*5'-cccatat 
40 QCCCTCATococx:crccr-3-(i2y»J#-^i2) SrrT^'^w'-^-i L 

94*Cr3(3i'\ 55"Cr30#:aO'72'C-C60f^?(DJS:i£4 1 

it <b br c 301^ ;Hf o /Co 

[0 06 8] CCDDNABTit*:^'^^^ Kp3RD215 CC# 

Ab. f#e>n/cffl»x.:7-^x5 K^ffli^r. iascDS^ 

«c;S:JI5MiE»U/Co f#6n/cffmejfe(*^. 1^TAG2 

[0 0 6 9 ] CI©fl«?iJ3D T;l/;^yy4^^X :i-hn7 
T xffjmi5lftf*CC J: ^ # y XX f^)\y^f& 
50 TJU;try y:^X • JL- hP:7TXH 1 6». PHB-4 



10 



13 

AC32^. AC32m. AC3 2 3tt. AC3 
2 1 3«c. AC2 9tt*, ^tl-en. 95mlCDMBigtti 

(0.9 % u ymni- h »; -i? A, 0.15% »; u ^ 

A. 0.05%ia{tr>-^:-:^'t? A) CC 1 mlCD 1 ^ >^ 

:^ h »;'i7A?:Jjax./ciSii&tcffiM0. ^.0^^:^:^^. 30 
•C-e^U/Co AC 3 2^. AC 3 2 1*^. ACS 2 3 
ACS 2 1 3«SC>^'AC2 9«CCO0^rti;^:^v^ 

i^>?:0.2q/LCOr^r^W5't±/Co 12B#ra1. 24B^r0l 36 

B#raiSLO'48B#FHiiigjaf^cc -en-en im-](Di%*^^>m 

[0 07 0] HI 6^. &0'AC3 2 1 StttCOl^Tti 

^Xntfi^ 30'C-C72B$P5e«L//Co i^cC^S. AC32 13 

[0071]H16^. AC32«. AC321tt. A 
C3 2 3». AC3 2 1 3mc-:)li'C{3:±i£OMBigife 
tC 1 m\<0 1 %^:7"^ >K:^ h y AJ&m/c^a&^fflli 
^P:^^^:^^^. so-C-CSSL/Co tj:^. ACS 2 
ACS2m. AC32S«?. RO'A C 3 2 1 3«c* 

^1 ;t^5'>SeS:JS^i:bfc7K»J 



C8) #ia^l 0- 1 08 68 2 

14 

f^tc^n-en i m^<o i %-^^7•^>K:^ h y •^7 A?r8s»au 

[0 07 2 ] ^m^. ^^C^^>i8CC J:orSi*«:|nIiRU. 

/Co l2:itB^10^30mqtC 2 mKDSgS" ^ -Jl^UiS. 
(15:85) i2m1cr>^Pn*;l/A^gSSnL/rffi1tb. 10 
0 Xn40 ^ralJjn^T'SCiCcj:^. ®ftl^^';x;:^r' 

h^^>^lia;^^f^B^Mcx:-i4A. b'^ u-;^^A^i 

GL1f X>X^tiSNEUTRA BOND-l ( :^7^ A:625m. t) 
'^AP^@0.25mm. ?gM^0.4 am) ^ffll^/Co SSt^ft 
li. iTJ^aSlOO -C^J^^S-C/^S^OiiSr^Sb/Co ft 
htxtcm^^mi. *2. *5<l:D^^3CC*'ro 
[0 07 3 ] 

[an 



20 



3HB 3HH 

(^;i>'%) 



H16 


3.00 


86 


100 


0 


PH&-4 


0-80 


0 






AC32 


0.99 


33 


78 


23 


AC321 


2-85 


92 


87 


13 


AC3Z3 


2.85 


92 


88 


12 


AC3213 


3.64 


96 


85 


15 


AC29 


3.20 


94 


92 


8 



3HB: Z-tL\^u^i/zf^\/—h. 3HH : 3 - 1 Ka^^i/^^^ifyji— N 



[0 07 4] 



^2 









1 7ifUaiXx;i^ffi^ 
3HB 3HH 

(^;i/%) 


H16 


4.27 
3.57 
4.13 
4.06 


79 
81 
79 
82 


100 0 
100 0 
100 0 
100 0 


AC3213 ^U— 


3.54 
3.60 
3.58 
2.22 


76 
77 

81 
70 


96 4 

95 5 

96 4 
96 4 



3HB: 3-t Kn#i/3^^b— 3HH: S-\i[^u^i/^^'*^Js.'--h 



[0 07 5] 



[^3] 



15 



C9) 



S3 ^^^>K^J^M^Ufc;K'Ja:XT-;U^ 
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3HB 3HV 3HHp 


H16 


2.50 


60 


50 50 0 


AC32 


0.77 


7 


30 67 5 


AC321 


1.67 


55 


46 52 2 


AC323 


1.27 


40 


48 45 7 


AC3Z13 


2.76 


67 


44 48 8 



3HB : 3-t KP=^i>:/^U— 3HV 
3HHp: 3-\i[^U^t^^:f^ ^JL^h 



[0 07 6] :ti:^^l^mimMmtLfcm^. mifiC^ 10 
H 1 6*feT'«^'; (3 -b Ku=^>>:/^lx- h) ;i^^A}< 

u-^-^^^-r-So cti«H 1 efti^^wrs^i^yxj^T^- 
^^i/fi^^^^jj^^ifc 6c7:)3HH (3-fc:K P ^i^-s=^-t^ 

^ ^ p h b -4 »-c«^SMSI«: <J: o 

PHB-4*|c(CTxn^:f X • ^-t- 
fc^xS5S(D?J< V x;;^ r-;US^P^jte^*^tf E E 3 2 
»r>t^^AL/cAC3 2*^rJi3HH (3-bKn + S^ 20 
-.^ifyx-- h) ^^22^;u%(D:J^'; O-tFa + v- 
:/^u- h-3 t Fci^t^^=f1^^x-h) 

s^&ft (p(3HB-co-3HH) ) ^^mmttmM$>tcKi 33mm 

[0077]$e>^C. AC32m. AC323^. A 
C3 2 1 3^r{i3 HH^^12-15^;U%CDP(3HB-co-3 
HH) ^92'-96Sfi%Sab. ORFliSfiT-. ORF3 
ite^-. *>€>C^i^cr)M:^€:^^3l±'5C<b'C5j<'jxx 

[007 8] */c. »Ab/c3i<yx;^x;US^BSiSe 30 

• hD>'TXS*<7)^5©tCg^L//cAC 2 9»r 
fev 94aS%COP(3HB-co-3HH) ^SSb. S^tDS^tC-S 

[0 07 9] ft*>^K';xXf-JHR¥cDii5C^AC3 2 1 3 

motet C^. ^2CC^-rJ:^CC3HH^^4'-5^;U% 
(DP(3HB-co-3HH) * 76'^8UM:%§a C//Co W^iA('Cm, 

Lr*j3 HH^^4^;U%CDP(3HB-co-3HH) ^70SS% 

(3 - fc Kp=^^>:/^U- h) 7i'^^^'J'r-(0^^^f& 
(008 0]^c:fo. rxn^:t-x • =^i'b'xFA4 4 0 

CO-3HH) ^s«-r^ci)&5«^$nri^-5 (!^Ba¥7-26 

C/r96Sfi%CDP(3HB-co-3HH) S/c@esi>r5ffi'C* 

^m^m^mmmt bX76smM%<op ( 3 h e - c so 



(3HB-CO-3HH) ^^?£«Si?>r 

[0 08 1 ] -^y'^iym^^mtLfcm-^. a2tc^ 

H16«rti3i<U (3-b Kn + ix^y^u- h-3-b 
Kn^-i^vN-y U-h) ^tS^ (PC3HB-CO-3HO) 
^r^o cntiHie^OWT'S^ Vxxf*;ufi^^^ts 
^l^7cD3HHp (3-b Kn+^V-v^^'^yx-F) 
^SMiLftC^/c»t>-C*5« PHB-4*fcCCTxn^:f 
X • +i'bxS3fc(D:i<';xX7^;US^^ite-T-^^tf 
EE32»r>i'^«AL//cAC32ttr{i3HHp^*5 
^>»b%<D^U (3-b KP=<^->y^U- h-3-fc Kp 
+^>>'^*1; f.-3 -t KP+^>-^>^t>yx- h) H5E 

J^^tt (P(3HB^o-3H^-CO-3HHp)) 4f£i*B<*SJI^ . 
/cO 7MM%SSL/c„ 

[0 08 2 ] $^CC. AC3 2 Itt. AC 3 2 3*^. A 
C3 2 1 3«^rtJ3 HHp^*2'-8^;l/%CC)P(3HB-co 
-3HV-co~3HHp)^40^67Sfi%§aOv ORFliiG 

[0 08 3] Cla6<DJS*30^6. rxp^:^x • ^t'b' 
xS3RcDJi<yxxf-;l.fi^^{i]^?S4-'7cD3-b K 
p^vt;U:^>^^^>'-7-^x^^ h t-r^i^m^^'J 
xxf*;u^^^r-5>c<h3&ir#-5>i#^^o 

[0 084] ClliS«^4D ORF3(D®g6|a]^ 
EE 3 2»rit^^<hUr. PC RftOCcfcoTORF 3 
ite^^rlfifib. ^^:/^X^ KPET-3a 
x>ttS) cDT7:7*P't-^-T^CCifAU/Co PCR« 

5' -OCCATATCyVGCCXIACMTCCCrGGAACrAC^ (ffiJ^JS"^ 
13) :fej:0'5' -CrCOIATCCGCCCCTGCrrAACXXIAOCTTC-S ^ 

(ffi?»J§-^14) ^y^-YV-cbUT. 95-Cr6(«.\ 68'C 
-C3C«?CDJSJ£* l1^-<^'>'^<tLr251f*-<^;Hf-:>/c„ ft 
htlfcy'^y^^ K^fflC>r;^SBL21(DE3) ^ 

T. NB3«^<h-r€>o 

[0 08 5] NB3«^100ml ©L Big«6"C30*C. 4B# 

PhUSSL. 'rvypb;l/^:t;^'^i^hb*^^5>K (ipr 
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(10) ^^pa^io 

18 

* At K^^— fe'rStt*i«iai$ti/c, 
[0 08 6] 



108 68 2 



L21^E3) *Jc/PET-3a 



0 

1700 



[0087]x-/-r;U-CoAt ^^--fe'SlitJi^ P 
M) . 2Mie^CD*fflCCf4^^0R3fea^^t (263nm ) 
OrCii^cCl^rj^ hn-;U:7'5X a KP ET - 3 a^^A 

[0 0 8 8 ]^CT. -/;l/-C o A t K^^--fe' ^ 



M) ^cJ:^r^>^^•i5'^^g|i3$■^^, x^^r^u-coAt 

h y A - 4< y r ^ y ;i/r ^ Fy;ummi*»j^#T:^^ 

j&^o/co S/c^5tC7S-r<J:^k:l:brgtt*S^3fmc|Bi±s 

[0 08 9 ] 
[«5] 



^5 xy-r;i/-CoAfc:K5^— ^JtS^ 



1700 
5100 



[0 0 9 0] ffen/cfSMxyn'JU-Co At F^^- 

^6 r^^ms^o:)im 



[0 09 1 ] 

[^6] 



^tSxy Y;i^CoAt K 5 31 -if 



SAQSLEVGQKARLSKRFGAA 
MSAQSLEVGQKARLSKRPGAA 



[0 092] CCOCti)^^^. ORF 3;ii^xy w^jU-C o 
At K-^^— fe'^rj- KUrCi-SCiT&^fillgr^/Co Me 

fc. ORF 3tcn- K^n-2>xy-r.FU-C o At 

[0 09 3] rSMifi'J^ojgj£;i§?g^ ( s ) - 3 - 1 F a 
+i^>^^»;;U-Co AT^t FPy:^— fe' (i^^vttSf) 

(^if5?rSao.2 :xx.;; h/ml ) igg^t^x ^>T ^ F 
rf=^x>>?^i:7U^^F (NAD + ) (SJ^?fiS0.5iTW 

) *:8^SD^4i. xy >f ;l/-c o A t F^^-'fe'Oi^ 
(s) -ft^^wr^nt^, ^^L/c (S) -3 
- t Fa + i^:/^'J;l/-C o Atif^t F^y:^-42C[)^^ 



30^- o r N A D + (JStcS N a D H^&S^^ L . 340n 

m (fcnmmrm^^^o^. i»ccxy>r;i.-coAt f 

^^-i27&^ (R) -{^^m&lX^tl^t. NADH«^^ 

[ 0 0 9 4 ] ^TCCiKfJ: ^tc. OR F 3 CCn- F^ti 
^x^Y^U-C o At F^^-'fe'^ffili/clS'^r«. 34 
Onm (D®3t:i^fbtix y C o A t F ^^--^mm 
lE(Dm^tti3:tA.tmCr:$>'z>tcf)\ rPSSO (S) 
Sfl^x^-<;l/-Co At F'^i5?--t2 (S^^^ttM) ^ffl 
C^/c«^r«. NADH(D^CCf*^R7feS^{b:05Me> 
40 n/Co 

[0095] 

[*7] 



^7 l^>aO340nii tC*?t:t^«3^jg^^b 



0RF3 



0.045 
0.047 
0.146 



[0 09 6] CCDjfSm:?&^6. ffil^x./ JU- C o A t F oAt F ^ a?--fef*n-- F LTC^^C i;i>5i»;&>-:>/c, 

(R) -<*i|#Sa«jr4)-5>C<i:*5B^6*^<i:/j:-^ [0 09 7 ] 

/Co SEor. 0RF3ti (R) -i*!^gfi«jx>'H^;i.-c 50 mmom^] *:|£B^CCJ:0. ^J<y x>^7^;ua^S5^ji 



(XL) 



19 



ia?'JC>S? : 1 78 5 
ISS^lCOmM : qenanic DMA 



10-108682 



20 



er-. KiteT-^^tfffl^x.-^^^-. mm.i^^^^^ * [0 0 9 8 ] 
CO-3HH) ^^^<fc < ^^pjfigr * 6.^.rWffi-c* So * 

: 

ATG AO: CAA CCA TCr TAT GCjC CCG CTC TTC GAG GOC CTG GCC CAC TAC 48 
Met Ser Gin Pro Ser Tyr Gly Pro Leu Phe Glu Ala Leu Ala His Tyr 

15 10 15 

AAT GAC AAC CTG CTG GCC ATG GCC AAG GCC CAG ACA GAG CGC ACC GCC 96 
Asn Asp Lys Leu Leu Ala Met Ala Lys Ala Gin Thr Glu Arg Thr Ala 

20 25 30 

CAG GCC CTG CTG CAG ACC AAT CTG GAC GAT CTG CGC CAG CTG CTG GAG 144 
Gin Ala Leu Leu Gin Thr Asn Leu Asp Asp Leu Gly Gin Val Leu Glu 

35 40 45 

CAG CGC ACC CAG CAA CCC TGG CAG CTG ATC CAG GCC CAG ATG AAC TGG 192 
Gin Gly Ser Gin Gin Pro Trp Gin Leu He Gin Ala Gin Met Asn Trp 

50 55 60 

TGG CAC GAT CAG CrC AAG CTG ATG CAG CAC ACC CTG CTC AAA AGC OCA 240 
Trp Gin Asp Gin Leu Lys Leu Met Gin His Ihr Leu Leu Lys Ser Ala 
65 70 75 80 

GCC CAC CCG AGC GAG CCG CTG ATC ACC CCG GAG CGC AGC GAT CGC CGC 288 
Gly Gin Pro Ser Glu Pro Val lie Thr Pro Glu Arg Ser Asp Arg Arg 

85 90 95 

TTC AAC GCC GAG GCC TGG AGC GAA CAA CCC ATC TAT GAC TAC CTC AAG 336 
Fhe Lys Ala Glu Ala Trp Ser Glu Gin Pro He Tyr Asp Tyr Leu Lys 

100 105 110 

CAC TCC TAC CTG CTC ACC GCC ACC CAC CTG CTG GCC TCG CTG GAT GCC 384 
Gin Ser Tyr Leu Leu Thr Ala Arg His Leu Leu Ala Ser Val Asp Ala 

115 120 125 

CTG GAC CGC GTC CCC CAG AAC AGG CCG GAG CCG CTG CCT TTC TTC ACC 432 
Leu Glu Gly Val Pro Gin Lys Ser Arg Glu Arg Leu Arg Phe Phe Thr 

130 135 140 

CGC CAC TAC CTC AAC GCC ATC GCC CCC ACC AAC TTC CTG GCC ACC AAC 480 
Arq Gin Tyr Val Asn Ala Met Ala Pro Ser Asn Phe Leu Ala Thr Asn 
145 150 155 160 

CCC GAC CTG CTC AAG CTG ACC CTG GAG TCC GAC CGC CAG AAC CTG CTG 528 
Pro Glu Leu Leu Lys Leu Thr Leu Glu Ser Asp Gly Gin Asn Leu Val 

165 170 175 

CGC GGA CTG GCC CTC TTC GCX: GAG GAT CTG GAC CGC AGC GCC GAT CAC 576 
Arg Gly Leu Ala Leu Leu Ala Glu Asp Leu Glu Arg Ser Ala Asp Gin 

180 185 190 

CTC AAC ATC CGC CTG ACC GAC GAA TCC GCC TTC GAC CTC CGC CCG GAT 624 
Leu Asn He Arg Leu Thr Asp Glu Ser Ala Phe Glu Leu Gly Arg Asp 

195 200 205 

CTG GCC CTC ACC CCC GCC COG CTG CTG CAG OGC ACC GAG CTC TAT GAG 672 
Leu Ala Leu Thr Pro Gly Arg Val Val Gin Arg Thr Glu Leu Tyr Glu 



960 



1008 



1056 



(12) it$ga¥l 0- 1 08 68 2 

21 ^2 
210 215 220 

CrC ATT CAC TAC AGC CCG ACT ACC GAG ACG GTG GGC AAG ACA CCT CFG 720 
Leu He Gin Tyr Ser Pro Thr Thr Glu Thr V^l Gly Lys Thr Pro Val 
225 230 235 240 

CTG ATA GTG CCG CCC TTC ATC AAC AAG TAC TAC ATC ATG CAC ATG CGG 768 
Leu ne Val Pro Pro Phe He Asn Lys Tyr Tyr He Met Asp Met Arq 

245 250 255 

GCC CAG AAC TCC CTG GTC GCC TGG CTG CTC QCC CAC GGC CAG ACG GTA 816 
Pro Gin Asn Ser Leu Val Ala Trp Leu Val Ala Gin Gly Gin Thr Val 

260 265 270 

TTC ATG ATC TCC TGG CGC AAC CCG GGC CTG QCC CAG GCC CAA ATC G^T 864 
Phe Met He Ser Trp Arq Asn Pro Gly Val Ala Gin Ala Gin He Asp 

275 280 285 

CTC G/VC GAC TAC GTG CTG GAT GGC CTC ATC GCC GCC CTG GAC GGC CTG 912 
Leu Asp Asp Tyr Val Val Asp Gly Val He Ala Ala Leu Asp Gly Val 

290 295 300 

OVG COG GCC ACC GGC GAG CGG GAG CTG CAC GQC ATC GGC TAC TCC ATC 

Glu Ala Ala TTir Gly C\u Arq Glu Val His Gly He Gly Tyr Cys He 
305 310 315 320 

GGC GGC ACC GCC CTG TGG CTC CCC ATG GGC TGG CTG GCG GCC CGG CGC 
Gly Gly Thr Ala Leu Ser Leu Ala Met Gly Trp Leu Ala Ala Arq Arq 

325 330 335 

CAG AAG CAG CGG GTG GGC ACC GCC ACC CTG TTC ACT ACC CTG CTG GAC 
Gin Lys Gin Arq Val Arq Thr Ala Thr Leu Phe Thr Thr Leu Leu Asp 

340 345 350 

TTC TCC CAC CCC GGG GAG CTT GGC ATC TTC ATC CAC GAG CCC ATC ATA 1104 
Phe Ser Gin Pro Gly Glu Leu Gly He Phe He His Glu Pro He He 

355 360 365 

QCC GGC CTC GAG GCG CAA AAT GAG GCC AAG GGC ATC ATG GAC GGG GGC 1152 
Ala Ala Leu Glu Ala Gin Asn Glu Ala Lys ay He Met Asp Gly Arq 

370 375 380 

CAC CTG GCG GTC TCC TTC AGC CTG CTG CGG GAG AAC AGC CTC TAC TGG 1200 
Gin Leu Ala Val Ser Phe Ser Leu Leu Arq Glu Asn Ser Leu Tyr Trp 
385 390 395 400 

AAC TAC TAC ATC GAC AGC TAC CTC AAG GGT CAG AGC CCG GTG GCC TTC 1248 
Asn Tyr Tyr He Asp Ser Tyr Leu Lys Gly Gin Ser Pro Val Ala Fhe 

405 410 415 

GAT CTG CTG CAC TGG AAC AGC GAC AGC ACC AAT CTG GCG GGC AAG ACC 1296 
Asp Leu Leu His Trp Asn Ser Asp Ser Thr Asn Val Ala Gly Lys Thr 

420 42 5 430 

CAC AAC AGC CTG CTG CGC CCT CTC TAC CTG GAC AAC CAG CTG CTG AAG 1344 
His Asn Ser Leu Leu Arq Arq Leu Tyr Leu Glu Asn Gin Leu Val Lys 

435 440 445 

GGG GAG CTC AAG ATC CGC AAC ACC CGC ATC GAT CTC GGC AAG CTG AAG 1392 
Gly Glu Leu Lys He Arq Asn Thr Arq He Asp Leu Gly Lys Val Lys 

450 455 460 

ACC CCT CTG CTG CTG CTG TCC GCG CTG GAC GAT CAC ATC GCC CTC TGG 1440 
Thr Pro Val Leu Leu Val Ser Ala Val Asp Asp His He Ala Leu Trp 
465 470 475 480 

CAC CGC ACC TGG CAC GGC ATG AAG CTG TTT GGC GGG GAC CAC CGC TTC 1488 



(13) mmW- 1 0- 1 08 68 2 

23 24 
Gin Gly Ihr Trp Gin Gly Met Lys Leu Phe Gly Cly Glu Gin Arq Phe 

485 490 495 

CrC CrC GCG gag TCC OOC OVC ATC GCC GGC ATC ATC AAC CCG CCG GCC 1536 
Leu Leu Ala Glu Ser Gly His He Ala Gly He He Asn Pro Pro Ala 

500 505 510 

GCC AAC AAG TAC GGC TTC TGG CAC AAC GGC GCC GAG GCC GAG AGC CCG 1584 
Ala Asn Lys Tyr Gly Phe Trp His Asn Cly Ala Glu Ala Glu Ser Pro 

515 520 525 

G^G AGC TGG CTG GCA OGG GCG ACG CAC CAG GOC GGC TCC TGG TGG CCC 1632 
Glu Ser Trp Leu Ala Gly Ala Thr His Gin Gly Gly Ser Trp Trp Pro 

530 535 540 

G^G ATG ATG GGC TIT ATC CAG AAC CGT GAC GAA GOG TCA GAG CCC GTC 1680 
Glu Met Met Gly Phe He Gin Asn Arq Asp Glu Gly Ser Glu Pro Val 
545 550 555 560 

CCC GCG CGG GTC CCG GAG GAA CCC CTG GCC CCC GCC CCC GGC CAC TAT 1728 
Pro Ala Arq Val Pro Glu Glu Gly Leu Ala Pro Ala Pro Gly His Tyr 

565 570 575 

GTC AAG GTG CGG CTC MC CCC GTG TTT GCC TGC CCA ACA GAG GAG G^C 1776 
Val Lys Val Arq Leu Asn Pro Val Phe Ala Cys Pro Thr Glu Glu Asp 

580 585 590 

OCC GCA TGA 1785 
Ala Ala 

[0 09 9 ] : 2 * : m.im 

ie?'J : 

Met Ser Gin Pro Ser Tyr Gly Pro Leu Phe Glu Ala Leu Ala His Tyr 

15 10 15 

Asn Asp Lys Leu Leu Ala Met Ala Lys Ala Gin Thr Glu Arq Thr Ala 

20 25 30 

Gin Ala Leu Leu Gin Thr Asn Leu Asp Asp Leu Gly Gin Val Leu Glu 

35 40 45 

Gin Gly Ser Gin Gin Pro Trp Gin Leu He Cln Ala Gin Met Asn Trp 

50 55 60 

Trp Gin Asp Gin Leu Lys Leu Met Gin His Thr Leu Leu Lys Ser Ala 
65 70 75 80 

Gly Gin Pro Ser Glu Pro Val He Thr Pro Glu Arq Ser Asp Arq Arq 

85 90 95 

Phe Lys Ala Glu Ala Trp Ser Glu Gin Pro He Tyr Asp Tyr Leu Lys 

100 105 no 

Gin Ser Tyr Leu Leu Thr Ala Arq His Leu Leu Ala Ser Val Asp Ala 

115 120 125 

Leu Glu Gly Val Pro Gin Lys Ser Arq Glu Arq Leu Arq Phe Phe Thr 

130 135 140 

Arq Gin Tyr Val Asn Ala Met Ala Pro Ser Asn Phe Leu Ala Thr Asn 
145 150 155 160 

Pro Glu Leu Leu Lys Leu Thr Leu Glu Ser' Asp Gly Gin Asn Leu Val 

165 170 175 

Arq Gly Leu Ala Leu Leu Ala Glu Asp Leu Glu Arq Ser Ala Asp Gin 
180 185 190 




(14) 
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25 



26 



Leu Asn lie Arq Leu Thr Asp Glu Ser Ala Phe G1u Leu Gly Arg Asp 

195 200 205 

Leu Ala Leu Thr Pro Gly Arq Val Val Gin Arq Thr Glu Leu Tyr Glu 

210 215 220 

Leu He Gin Tyr Ser Pro Thr Thr Glu Thr Val Gly Lys Thr Pro Val 
225 230 235 240 

Leu He Val Pro Pro Fhe He Asn Lys Tyr Tyr He Met Asp Met Arq 

245 250 255 

Pro Gin Asn Ser Leu Val Ala Trp Leu Val Ala Gin Gly Gin Thr Val 

260 265 270 

Phe Met He Ser Trp Arq Asn Pro Gly Val Ala Gin Ala Gin He Asp 

275 280 285 

Leu Asp Asp Tyr Val Val Asp Gly Val He Ala Ala Leu Asp Gly Val 

290 295 300 

Glu Ala Ala Thr Gly Glu Arg Glu Val His ay He Gly Tyr Cys He 
305 310 315 320 

Gly Gly Thr Ala Leu Ser Leu Ala Met Gly Trp Leu Ala Ala Arg Arq 

325 330 335 

Gin Lvs Gin Arq Val Arq Thr Ala Thr Leu Phe Thr Thr Leu Leu Asp 

340 345 350 

Phe Ser Gin Pro Gly Glu Leu Gly He Phe He His Glu Pro He He 

355 360 365 

Ala Ala Leu Glu Ala an Asn Glu Ala Lys Gly He Met Asp Gly Arq 

370 375 380 

Gin Leu Ala Val Ser Phe Ser Leu Leu Arq a u Asn Ser Leu Tyr Trp 
385 390 395 400 

Asn Tyr Tyr He Asp Ser Tyr Leu Lys Gly Gin Ser Pro Val Ala Phe 

405 410 415 

Asp Leu Leu His Trp Asn Ser Asp Ser Thr Asn Val Ala Gly Lys Thr 

420 425 430 

His Asn Ser Leu Leu Arq Arq Leu Tyr Leu Glu Asn Gin Leu Val Lys 

435 440 445 

ay Glu Leu Lys He Arg Asn Thr Arq He Asp Leu ay Lys Val Lys 

450 455 460 

Thr Pro Val Leu Leu Val Ser Ala Val Asp Asp His He Ala Leu Trp 
465 470 475 480 

an Gly Thr Trp Gin ay Met Lys Leu Phe ay Gly Glu an Arq Phe 

485 490 495 

Leu Leu Ala Glu Ser ay His He Ala Gly He He Asn Pro Pro Ala 

500 505 510 

Ala Asn Lys Tyr Gly Phe Trp His Asn Gly Ala Glu Ala au Ser Pro 

515 520 525 

au Ser Trp Leu Ala ay Ala Thr His Gin Gly Gly Ser Trp Trp Pro 

530 535 540 

Glu Met Met Gly Phe He Gin Asn Arq Asp Glu Gly Ser Glu Pro Val 
545 550 555 560 

Pro Ala Arq Val Pro au Glu Gly Leu Ala Pro Ala Pro Gly His Tyr 

565 570 575 

Val Lys Val Arq Leu Asn Pro Val Phe Ala Cys Pro Thr Glu Glu Asp 



580 



585 



590 
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Ala Ala 

[0100] m^m^ : 3 

iS?*JcDS$ : 3 5 4 

ffi^J : 

ATG ATG AAT ATG 
Met Met Asn Met 
1 

TTC GCC GCC CCC 
Ph e Ala 
Gin Leu 



^mW- 1 0- 1 08 68 2 
28 



GAC CTG 
Asp Val 

5 

CrC ACQ 

A I a 
Leu 



ATG AAG 
He Lys 

COG TAG 
Pro 
A 1 a 
2 0 



* ^JOaS : genomic ONA 

AGG TTT ACG GAG GAG ATG CAA QGC 
Ser Phe Thr Glu Gin Met Gin Cly 

10 15 
AAC GAG CTG CTG GCC AGC AAC ATG 
Leu Thr Arg Tyr 
Ser Asn lie 



( 0 1 0 1 ] mim^ 4 

ie?iJCDg3 : 1 1 7 



48 



9 6 
Asn 

2 5 











30 








GAA 


CAG 


CTG 


ACC 


CGG 


TTG 


CAG 


CTG GCC 


TCC 


GCC 


AAC 


GCC 


TAG 


GCC 


GAA 


1 44 


G 1 u 


G 1 n 


Leu 


Th r 


Arg 


Leu 


G 1 n 


Leu Ala 


Ser 


A 1 a 


Asn 


A 1 a 


Tyr 


A 1 a 


G 1 u 








3 5 










4 0 








4 5 










CTG 


GGC 


CTC 


AAC 


CAG 


TTG 


CAG 


GCC GTG 


AGC 


AAG 


GTG 


CAG 


GAC 


ACC 


CAG 


1 9 2 


Leu 


G 1 y 


Leu 


Asn 


G 1 n 


Leu 


G 1 n 


Ala Val 


Ser 


Lys 


Va 1 


G 1 n 


Asp 


Th r 


G I n 






50 










5 5 








6 0 












AGC 


CTG 


GCG 


GCC 


CTG 


GGC 


ACA 


GTG CAA 


CTG 


GAG 


ACC 


GCC 


AGC 


CAG 


CTC 


2 40 


Ser 


Leu 


A I a 


A 1 a 


Leu 


G 1 y 


Thr 


Val Gin 


Leu 


G 1 u 


Th r 


A 1 a 


Ser 


G 1 n 


Leu 




6 5 










7 0 








7 5 










8 0 




TCC 


CGC 


CAG 


ATG 


CTG 


GAT 


GAC 


ATC CAG 


AAG 


CTG 


AGC 


GCC 


CTC 


GGC 


CAG 


2 88 


Ser 


Arg 


G 1 n 


Me t 


Leu 


Asp 


Asp 


lie Gin 


Lys 


Leu 


Ser 


A 1 a 


Leu 


G 1 y 


G 1 n 












8 5 








90 










9 5 






CAG 


TTC 


AAG 


GAA 


GAG 


CTG 


GAT 


GTC CTG 


ACC 


GCA 


GAC 


GGC 


ATC 


AAG 


AAA 


3 3 6 


G 1 n 


Phe 


Lys 


G 1 u 


G 1 u 


Leu 


Asp 


Val Leu 


Th r 


A 1 a 


Asp 


G 1 y 


I 1 e 


Lys 


Lys 










1 0 0 








1 0 5 










1 1 0 








AGC 


ACG 


GGC 


AAG 


GCC 


TGA 




















3 54 


Ser 


Thr 


G I y 


Lys 


A 1 a 









1 1 5 



50 



(16) 



29 



1 0- 1 08682 

30 



Me t 
Ph e 
1 

1 0 
Ph e 
G I n 



Me t 
Th r 



A 1 a 
Leu 



A s n Met 
G 1 u Gin 



G 1 u Gin 
S e r Ala 



Leu 
S e r 



S e r 
Leu 
6 5 

S e r 
Ly s 

9 0 
G 1 n 
Th r 



G 1 y 
L y s 
50 

Leu 
G 1 u 

7 5 
At s 
Leu 



Ph e 
A 1 a 



Ala Pro 
Leu Ala 
2 0 

Leu Th r 
A s n Ala 
3 5 

45 

Leu A s n 
Va 1 Gin 

6 0 

Ala Ala 
Th r Ala 



Asp Val lie Lys Ser 
Met Gin G 1 y 
5 

1 5 

Leu Thr Arg Tyr Asn 
Ser Asn lie 

2 5 

30 

Arg Leu Gin Leu Ala 
Tyr Ala G 1 u 

40 

Gin Leu Gin Ala Val 
Asp Thr Gin 
55 

Leu Gly Thr Val Gin 
Ser Gin Leu 



7 0 



80 



Ser Thr 



Gin Met 
Ser Ala 



Lys G 1 u 
Asp Gly 
1 0 0 

Gly Lys 
1 1 5 



Leu Asp Asp lie Gin 
Leu Gly Gin 



85 



9 5 



[0102] i2?»J#-^ 
ie^!jCD:g$ : 4 0 5 



Glu Leu Asp Val Leu 
lie Lys Lys 

1 0 5 

1 1 0 
A 1 a 

* i^iJCDaS : genonic DNA 



ATG 
GAG 
Me t 
G 1 n 
1 

1 0 
CGG 
TTC 
Arg 
Ph e 



TTC 
TTC 
Ph e 



AGC 


GCA 


CAA 


TCC 


CTG 


GAA 


GTA 


GGC 


AAG 


GCC 


CGT 


CTC 


AGC 


AAG 




48 


Ser 


A 1 a 


G 1 n 


Ser 


Leu 


G I u 


Va 1 


Gl y 


Lys 


A I a 


Arg 


Leu 
5 


Ser 


Lys 














1 5 








TTC 


GGG 


GCG 


GCG 


GAG 


GTA 


GCC 


GCC 


GCC 


GCG 


CTC 


TCG 


GAG 


GAC 




9 6 


Ph e 


G 1 y 


A 1 a 


A 1 a 


G 1 u 


Va 1 


A 1 a 


A 1 a 


A 1 a 


A 1 a 


Leu 


Ser 


G 1 u 


Asp 








2 0 










25 








3 0 










AAC 


CCC 


CTG 


CAC 


CTG 


GAC 


CCG 


GCC 


GCC 


GCC 


ACC 


ACG 


GCG 


TTC 




1 44 


Asn 


Pro 


Leu 


H i s 


Leu 


Asp 


Pro 


A 1 a 




31 

Ph e Ala 



GAG CGG 

CTC GCC 

G 1 u At s 

Leu Ala 
50 

CTG CTG 

GGG AGO 

Leu Leu 

G 1 y S e r 
6 5 

75 

AGO CTC 

GTC GGG 

S e r Leu 

Va 1 Gly 

9 0 

GAG GTG 

GAC AAG 

G 1 u Va I 

Asp L y s 



ACC ACC 

GCC CTC 

Thr Thr 

Ala Leu 



GCC GTG 

Ala V a 1 
1 30 

[0103] : 6 

Se^iJcDSS : 1 34 

mm : 

Met S e r 
Gin L y s 
1 

1 0 

A r g P h e 
Ph e Ala 



P h e A s n 




C17) ^mW- 1 0- 1 08 68 2 

32 



A 1 a 


Th r 


Thr 


A 1 a 


Phe 






3 5 










4 0 






4 5 












CCC 


ATA 


GTC 


CAC 


GGC 


ATG 


CTG 


AGC 


CTC 


TTC 


TCC 


GGG 




1 9 2 


Pro 


I 1 e 


V a I 


H i s 


G I y 


Me t 


Leu 


S e r 


Leu 


Ph e 


S e r 


G 1 y 














5 5 






6 0 














GGC 


CAG 


CAG 


TTG 


CCG 


GGC 


AAG 


ATC 


TAT 


CTG 


GGT 


CAA 




2 4 0 


G 1 y 


G 1 n 


G 1 n 


Leu 


Pro 


G 1 y 


L y s 


I 1 e 


Ty r 


Leu 


G 1 y 


G 1 n 












7 0 
















8 0 






AGC 


TTC 


AAG 


CTG 


CCG 


GTC 


TTT 


GAC 


GAG 


GTG 


ACG 


GCC 




2 8 8 


S e r 


Ph e 


L y s 


. Leu 


Pro 


Va 1 


Ph e 


Asp 


G 1 u 


Va 1 


Th r 


A 1 a 










8 5 
















9 5 








GAG 


GTG 


ACC 


GCC 


CTT 


CGC 


GAG 


CCC 


ATC 


GCC 


ACC 


CTG 




3 36 


G 1 u 


Va 1 


Thr 


A 1 a 


Leu 


A r g 


G 1 u 


Pro 


I 1 e 


A 1 a 


Th r 


Leu 








1 0 0 










1 0 5 






1 1 0 










CGC 


ATC 


TTC 


ACC 


CAA 


GGC 


GGC 


GCC 


GTG 


ACG 


GGG 


GAA 




3 84 


A r gr 


1 1 e 


Ph e 


Th r 


G 1 n 


G 1 y 


G 1 y 


A 1 a 


Va 1 


Th r 


G 1 y 


G 1 u 






1 1 5 










1 2 0 






1 2 5 












GTC 


AAG 


CTG 


CCT 


TAA 


















405 


V a 1 


L y s 


Leu 


Pro 












* 1 
























*40 










A 1 a 


G 1 n 


S e r 


Leu 


G 1 u 


Va 1 


G 1 y 


A I a 


Are 


Leu 


S e r 


L y s 










5 
















1 5 








G 1 y 


A 1 a 


A 1 a 


G 1 u 


V a 1 


A 1 a 


A 1 a 


A 1 a 


Leu 


S e r 


G 1 u 


Asp 








2 0 










25 






3 0 










Pro 


Leu 


H i s 


Leu 


Asp 


Pro 


A 1 a 



(18) 



33 
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34 



4 0 



Phe Ala Ala Th r Thr Ala Phe 
3 5 

4 5 

Glu Avg Pro lie Val His Gly Met Leu 
Leu Ala Ser Leu Phe Ser Gly 



5 0 



55 



6 0 



Leu Leu Gly Gin Gin Leu Pro Gly Ly s 
Gly Ser lie Tyr Leu Gly Gin 



6 5 



70 



7 5 



8 0 



Ser Leu Ser Phe Lys Leu Pro Val Phe 
Val Gly Asp Glu Val Thr Ala 

85 



9 0 



9 5 



Glu Val Glu Val Thr Ala Leu Arg Glu 
Asp Lys Pro lie Ala Thr Leu 



1 0 0 



1 0 5 



1 1 0 



Thr Thr Ars lie Phe Thr Gin Gly Gly 
Ala Leu Ala Val Thr Gly Glu 



1 1 5 

1 2 5 

Ala Val Val Lys Leu Pro 
1 3 0 



1 2 0 



[0104] ie?iJ#-^ : 7 

la^JCOSS : 2 7 



GCSCCSTGGA 
[0105] gB?«J#-^ : 8 
Se^iJCDS^ : 2 7 

ie^ij<oM : mm 

SACCCASCCS 
[0106] im^^ : 9 
|ffi^J(DS$ : 3 1 8 7 

^]<DWM : qenonic DMA 

acatctcx:ac 
gcagcagcxx: 
cagcgcggcg 
cxxicrtjccrc 

TTTACACCAA 

ACACGTAAAC 

crcTTcrccc 



TCAAYAACm YTAYATC 



27 

^lomm : f&om^ (^^DNA) 



CTTCCARTCSG GCCACCA 



27 



ItSi^^Tfa-^ : CDS 

U&iilS : 384.. 734 

mk^m-^E^ : CDS 
40 ??ffi{ag : 830..26aa 



CGGGGTGCTC GCCTGCXXCA CCGCCOCGAC GGCCAOGCXIC GACXj^ACCGA 
GAGAGCTTTC ATCOXATTC CTTCGCACTC TOVATGACCTT CCCACCCTAT 
CCCXTTGCGGC GAGCXXIGCOC CCXACCOVJT GCCTCACCTC TCCTCTGATC 
CACCCXXICTC GCrCACAAAA AAATTCAAAC AGAAATTAAC ATTTATOTCA 

ACCCCArrrc gttgcagaat cxtcaaaoct gtctttcaac acagcaacca 

ACGGATGACA TGCACTACCC CTAAGAAGGC CCGATTGGCC CACAACAACA 
CAACrCXiiAGA CCG ATG ATG AAT ATG GAC CTG ATC AAC AGC 
Met Met Asn Met Asp Val lie Lys Ser 



60 
120 
180 
240 
300 
360 
410 
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35 36 
1 5 

TTT ACC GAG GAG ATG CAA GOC TTC CCC GCC CCC CTC ACC CGC TAG AAC 458 
Phe Ihr Glu Gin Met Gin Gly Phe Ala Ala Pro Leu Thr Arq Tyr Asn 
10 15 20 25 

GAG CTG CTG GGC AGC AAC ATG GAA GAG CTG ACC CCC TTG GAG CTG OCG 506 
Gin Leu Leu Ala Ser Asn He Glu Gin Leu Ihr Arq Leu Gin Leu Ala 

30 35 40 

TGG GCC AAC GCC TAG GCC GAA CTG GCC CTG AAC CAC TTG GAG GCC CTG 554 
Ser Ala Asn Ala Tyr Ala Glu Leu Gly Leu Asn Gin Leu Gin Ala Val 

45 50 55 

ACC AAC CTG GAG GAG ACC GAG ACC CTG GCC GCC CTC GGC ACA CTG GAA 602 
Ser Lys Val Gin Asp Thr Gin Ser Leu Ala Ala Leu Gly Thr Val Gin 

60 65 70 

CTG GAG ACC GCC AGC CAG CTC TCC CGC GAG ATG CTG GAT GAG ATG GAG 650 
Leu Glu Thr Ala Ser Gin Leu Ser Arq Gin Mat Leu Asp Asp He Gin 

75 80 85 

AAG CPG AGC GCC CTC GOC CAG CAG TTC AAG GAA GAG CTC CAT CTC CTG 698 
Lys Leu Ser Ala Leu Gly Gin Gin Phe Lys Glu Glu Leu Asp Val Leu 
90 95 IDO 105 

ACC GCA GAC GGC ATG AAG AAA AGC ACG GOC AAG GCC TGATAACGCC 744 
Thr Ala Asp Gly He Lys Lys Ser Thr Gly Lys Ala 

no 115 
TCCCTGCCCG TTCGGGCAGC CACATCTCGC CATGAaCCA CQCTACGCGG TACTTOCCGC 804 
CrCGOCTCTG GCTGAACGAG ACCAC ATG ACC CAA GCA TCT TAT GGC CCG CTG 856 

Met Ser Gin Pro Ser Tyr Gly Pro Leu 
1 5 

TTC GAC GCC CTG GCC CAC TAG AAT GAC AAG CTG CTG GCC ATG GCC AAC 904 
Phe Glu Ala Leu Ala Kis Tyr Asn Asp Lys Leu Leu Ala Met Ala Lys 
10 15 20 25 

OCC CAG ACA CAG CGC ACC GCC CAG GCG CTG CTG CAC ACC AAT CTC GAC 952 
Ala Gin Thr Glu Arq Thr Ala Gin Ala Leu Leu Gin Thr Asn Leu Asp 

30 35 40 

GAT CTC GGC CAG CTG CTG GAC CAG GGC AGC 0\G CAA CCC TGG CAC CTC 1000 
Asp Leu Gly Gin Val Leu Glu Gin Gly Ser Gin Gin Pro Trp Gin Leu 

45 50 55 

ATG CAC CCC CAC ATG AAC TGG TCC CAG GAT CAG CTC AAG CTG ATG CAG 1048 
He Gin Ala Gin Met Asn Trp Trp Gin Asp Gin Leu Lys Leu Met Gin 

60 65 70 

CAC AGC CTG CTC AAA ACC GCA GGC CAG CCC AGC GAC CCC CTC ATG ACC 1096 
His Thr Leu Leu Lys Ser Ala Gly Gin Pro Ser Glu Pro Val He Thr 

75 80 85 

GCG GAC CGC AGC GAT CGC CGC TTC AAG GCC GAG GGC TGG AGC GAA CAA 1144 
Pro Glu Arq Ser Asp Arq Arq Phe Lys Ala Glu Ala Trp Ser Glu Gin 
90 95 100 105 

CCC ATC TAT GAC TAG CrC AAC CAG TCC TAG CTC CTC ACC CCC ACC CAC 1192 
Pro He Tyr Asp Tyr Leu Lvs Gin Ser Tyr Leu Leu Thr Ala Arq Wis 

no 115 120 

CTC CTG GCC TCC CTC CAT GCC CTC GAC GGC CTC CCC CAC AAC AGC CGC 1240 
Leu Leu Ala Ser Val Asp Ala Leu Glu Gly Val Pro Gin Lys Ser Arq 
125 130 135 



(20) 
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37 

GAG COG CTG CCT TTC TTC ACC CCC CAG TAG CTC Ml GCC ATG GCC CCC 
Glu Arq Leu Arg Phe Phe Thr Arq Gin Tyr Val Asn Ala Met Ala Pro 

140 145 150 

ACC AAC TTC CTG GCC ACC AAC CCC GAG CTG CTC AAC CTG ACC CTG GAG 
Ser Asn Phe Leu Ala Thr Asn Pro Glu Leu Leu Lys Leu Thr Leu CTu 

155 150 165 

TCC GAC GGC CAG AAC CTG CTG CGC GGA CTG GCC CTC TTC GCC GAG GAT 
Ser Asp Gly Gin Asn Leu Val Arq Gly Leu Ala Leu Leu Ala Glu Asp 
170 175 1^0 ^5 

CTG GAG CGC AGC GCC CAT CAG CTC AAC ATC CGC CTG ACC GAC GAA TCC 
Leu Glu Arq Ser Ala Asp Gin Leu Asn He Arq Leu Thr Asp Glu Ser 

190 195 200 

GCC TTC GAG CTC OGG COG GAT CTG GCC CTG ACC COG GGC CGG CTG CTG 
Ala Phe Glu Leu Gly Arq Asp Leu Ala Leu Ihr Pro Gly Arq Val Val 

205 210 215 

CAC COC ACC GAG CTC TAT GAG CTC ATT CAG TAG AGC CCG ACT ACC GAG 
Gin Arq Thr Glu Leu Tyr Glu Leu He Gin Tyr Ser Pro Thr Thr Glu 

220 225 230 

ACG CTG GGC AAG ACA CCT CTG CTG ATA CTG CCG CCC TTC ATC AAC AAG 
■Thr Val Gly Lys Thr Pro Val Leu He Val Pro Pro Phe He Asn Lys 

235 240 245 

TAC TAG ATC ATG GAC ATG COG CCC CAG AAC TCC CTG CTC GCC TGG CTG 
Tyr Tyr He Met Asp Met Arq Pro Gin Asn Ser Leu Val Ala Trp Leu 
250 255 260 265 

CrC GCC CAG GGC CAG ACG CTA TTC ATG ATC TCC TGG CGC AAC CCG GGC 
Val Ala Gin Gly Gin Thr Val Phe Met He Ser Trp Arq Asn Pro Gly 

270 275 280 

CTG GCC CAG CCC CAA ATC GAT CTC GAC GAC TAC CTG CTG GAT GGC CTC 
Val Ala cm Ala Gin He Asp Leu Asp Asp Tyr Val Val Asp Gly Val 

285 290 295 

ATC GCC GCC CTG GAC GCC CTG GAG GCG GCC ACC GGC GAG CGG GAG CTG 
He Ala Ala Leu Asp Gly Val Glu Ala Ala Thr Gly Glu Arq Glu Val 

300 305 310 

CAC GOC ATC GGC TAC TGG ATC GGC CGC ACC GCC CTG TCC CTC GCC ATG 
His Gly He Gly Tyr Cys He Gly Gly Thr Ala Leu Ser Leu Ala Met 

315 320 325 

GGC TOG CTG GCG GCG CGG CGC CAG AAG CAG CGC CTG CCC ACC GCC ACC 
Gly Trp Leu Ala Ala Arq Arq Gin Lys Gin Arq Val Arq Thr Ala Thr 
330 335 340 345 

CTG TTC ACT ACC CTG CTG GAC TTC TCC CAG CCC GOG GAG CTT GGC ATC 
Leu Phe Thr Thr Leu Leu Asp Phe Ser Gin Pro Gly Glu Leu Gly He 

350 355 360 

TTC ATC CAC GAG CCC ATC ATA GCG GCG CTC GAG GCG CAA AAT GAG GCC 
Phe He His Glu Pro He He Ala Ala Leu CTu Ala Gin Asn Glu Ala 

365 370 375 

AAG GOC ATC ATG GAC OGG CGC CAG CTG GCG CTC TCC TTC AGC CTG CTG 
Lys Gly He Met Asp CTy Arq Gin Leu Ala Val Ser Phe Ser Leu Leu 

380 385 390 

COG GAC AAC AGC CTC TAC TGG AAC TAC TAC ATC GAC ACC TAC CTC AAG 
Arq Glu Asn Ser Leu Tyr Trp Asn Tyr Tyr He Asp Ser Tyr Leu Lys 



38 



1288 



1336 



1384 



1432 



1480 



1528 



1576 



1624 



1672 



1720 



1768 



1816 



1864 



1912 



1960 



2008 



2056 
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39 40 
395 400 405 

Gcrr CAG Acc ccG crc occ rrc gat crc ctg cac tgg aac agc gag acc 2104 

Gly Gin Ser Pro Val Ala Phe Asp Leu Leu His Trp Asn Ser Asp Ser 

410 415 420 425 

ACC AAT CTG GCG GGC AAG ACC CAC AAC AGC CTG CTG CGC CGT CTC TAG 2152 

Thr Asn Val Ala Gly Lys Thr His Asn Ser Leu Leu Arp Arq Leu Tyr 

430 435 440 

crc GAG AAC CAC CTG GTG AAC GCG GAG CTC AAG ATC CGC AAC ACC CGC 2200 
Leu Glu Asn Gin Leu Val Lys Gly Glu Leu Lys lie Arq Asn Thr Arq 

445 450 455 

ATC GAT CTC GGC AAG CTG AAC ACC CCT CTG CTC CTC GTG TCG GCG GTG 2248 
lie Asp Leu Gly Lys Val Lys Thr Pro Val Leu Leu Val Ser Ala Val 

460 465 470 

GAG GAT CAC ATC GCC CTC TGG CAG GGC ACC TGG OVS GGC ATG AAC CTG 2296 
Asp Asp His He Ala Leu Trp Gin Gly Thr Trp Gin Gly Met Lys Leu 

475 480 485 

TTT GGC GGC GAG CAG GGC TTC CTC CTG GCG GAG TGC CGC CAC ATC GCC 2344 
Phe Gly Gly Glu Gin Arq Phe Leu Leu Ala Glu Ser Gly His He Ala 
490 495 500 505 

GGC ATC ATC AAC CCG GCG GCC GCC AAC AAG TAC GGC TTC TGG CAC AAC 2392 
Gly He He Asn Pro Pro Ala Ala Asn Lys Tyr Gly Phe Trp His Asn 

510 515 520 

GOG GCC GAG GCC GAG AOC COG GAG ACC TCG CTG CCA GCG GCG ACC CAC 2440 
Gly Ala Glu Ala Glu Ser Pro Glu Ser Trp Leu Ala Gly Ala Thr His 

525 530 535 

CAG GGC GGC TCC TGG TGG CCC GAG ATG ATG GGC TTT ATC CAC AAC CCT 2488 
Gin Gly Gly Ser Trp Trp Pro Glu Met Met Gly Phe He Gin Asn Arq 

540 545 550 

GAG GAA GCG TCA GAG CCC CTC CCC GCC CGC CTC CGC GAG GAA GGC CTG 2536 
Asp Glu Gly Ser Glu Pro Val Pro Ala Arq Val Pro Glu Glu Gly Leu 

555 560 565 

GCC CCC GCC CCC GGC CAC TAT CTC AAG CTG CGC CTC AAC CCC CTG TTT 2584 
Ala Pro Ala Pro Gly His Tyr Val Lys Val Arq Leu Asn Pro Val Phe 
570 575 580 585 

OCC TGC CCA ACA GAG GAG CAC CCC CCA TGACCGCACA ATCGCTGGAA 2631 
Ala Cys Pro Thr Glu Glu Asp Ala Ala 
590 

CTACCCCAGA AGGCCCCTCT CAGCAACCOG TTGGGCCCCG CGGAGCTAGC CGCCTTCCCC 2691 
OCCCTCTCGG AGGACTTCAA CCCCCTGCAC CTOGACCCGG CCTTCGGCGC CACCACGGCG 2751 
TTCGAGCGGC CCATAtJTCCA CGCGATGCTC CTCGCCAGCC TCTTCTCCCG GCTGCTTOGGC 2811 
CACCAGTTCC CCGGCAACGC CACCATCTAT CTCCCTCAAA CCCTCAGCTT CAAGCTCCCC 2871 
CrCTTTGTCG GGGACGACCT GACQGCCGAC CTGGAGCTGA CCGCCCTTCG CCACCACAAC 2931 
OCCATCGCCA CCCTGACCAC CCGCATCTTC ACOCAAOGCG GCGCCCTCGC CCTGACGGGG 2991 
GAAGCCCTGG TCAAGCTGCC TTAACCACCG GCGGCACGCA GGCACAATCA QCCCGQCCCC 3051 
TGCCCGCCTG ATTCTTCTCC CCCGCTCCOC TTGCCCCCTT TTTCCGOGCA ATTTCCCCCA 3111 
GGCCCTTTCC CTGCCCCGCC TAACTCCCTA AAATGGCCGC CCTCCCCTCT ACCCATTCAT 3171 
CTAGCTACAC GAATTC 3187 

[0107] m^mn : 1 0 m<Dm. : r.m^ 
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^^m-tME^ : CDS 



*l?ffifl[g : 2611.. 3012 



AGATCrCGAC CGGGGrGCTG GCCTGGGCCA CCCCCXX:GAG GOCCAGCGCG OVGCACCGA 60 

OCAGCACCXX GAGAGGTTTC ATCQGGATTC CTTGCCACrC TGVATGACGT GCCAGCCTAT 120 

CAGCGCCGCG CCGCTCCGCC CAGQGCGCGC CGGACCCACT GCCTTCAGCTC TCGTCTCATC 180 

GGCCrCCCrC GACGGGCGTC GCTGACAAAA AAATTCAAAC AGAAATTAAC ATTTATGrCA 240 

TTTACACCAA ACCGG\TTTG GTTOCAGMT GCTCAAACCT GTGTTTG^C AGAGCAAGCA 300 

ACACGTAAAC AOGGATGACA TGOSCTACCX GTAAGAAOGG CCGATTGGCC CACAACAACA 360 

CrCTTCTGCC GAACTOGAGA CCGATGATG\ ATATGGACCT GATCAAGAGC TTTACCGAGC 420 

ACATGCAAGG CTTCOCCCCC CCCCTCACCC CCTACAACCA GCTGCTGGCC AGCAACATCG 480 

AACAGCTGAC CCGCTTGCAG CTGGCCTCCG CCAACGCCTA CQCCGAACTG QGCCTCAACC 540 

AGTTGCAGGC CGTGAOCAAG CTGCAGGACA CCCAGAGOCT GGCGGCCCTG GCCACACTGC 600 

AACTGGAGAC CGCCAGCCAG CTCTCCCGCC AGATCCTOGA TGACATCCAG AAGCTGAGCG 660 

CCCTCGGCCA GCAGTTCAAG GAAGAGCTOG ATCTCCTGAC CQCAGACGGC ATCAAGAAAA 720 

GCACGGGCAA GGCCTGATAA CCCCrGGCTG CCGGrTCGOG CAOCCACATC TCCCCATGAC 780 

TCGACGCTAC GGGCTAGTTC CCGGCTCGGG TGTCCCrOVA CGAGAGCACA TCAGCCAACC 840 

ATCTTATGGC CCGCTGTTCG AGGCCCTGGC CCACTACAAT GACAACCTGC TGGCCATGGC 900 

CAAGGCCCAG ACAGAXGCA CCGCCCAGOC GCTGCTGCAG AOCAATCTGG ACGATCTGGG 960 

ccAGcrccrc gagcaggcca gccagcaacc crcGCAOcrc atccaggccc acatgaactg 1020 

GTGGCACGAT CAGCTCAAGC TGATGCAGCA CACCCTGCTC AAAAGCQCAG OCCACCCGAG 1080 
CGAGCCGCTG ATCACCCCGG AGCGCAGCGA TCGCCGCTTC AAGGCCGAGG CCTGGAQCGA 1140 
ACAACCCATC TATGAOACC TCAACCACTC CTACCTGCrC ACCGCCACCC ACCrOCTGGC 1200 
CrCGCTGGAT GCCCTGGAGG CCGTCCCCG\ GAAGAGCCCG C^GGCTGC Gl ! iCI ICAC 1260 
GCGCCACTAC GTCAACGCCA TGGGCCCCACi CAACTTCCTG GCCAGCAACC CCGAGCTGCT 1320 
CAAGCTGACC CTOGACTrCCG ACGGCCAGAA CCTCCTGCGC GGACTCGCCC TCTTGGCCGA 1380 
GGATCrCGAG CGCAGCGCCG ATCAGCTCAA CATCCGCGTG ACCGACGAAT CCGCCTTCGA 1440 
OCrCGGGCGG GATCTGGCCC TGACCCCGOG CCGGCTGCTG CACCGCACCG ACCTCTATGA 1500 
OCrCATTCAG TACAOCCCGA CTACCGAGAC GGTGGGCAAG ACACCrCTGC TGATAGTGCC 1560 
QCCCrnCATC AAGAAETTACT ACATCATGGA CATCCGGGCC CACAACTCCC TCCTCGCCTC 1620 
GCTGCTCGCC CAGGOCCACA CGGTATTCAT CATCTCCTGG COCAACCCGG OCCTGGCCCA 1680 
QGCCCAAATC GATCTCGACG ACTACCrGGT GGATGGCGTC ATCGCCGCCC TGGACGGCCT 1740 
GGAQGCGGCC ACCGQCGAGC GGGACGTGCA CGGCATCGGC T/CTGCATCG GCGGCACCGC 1800 
CCTCrCGCrC GCCATGGGCT GGCTGGCCQC CCQGCGCCAG AACCAGCGGG TGCGCACCGC 1860 
CACCCrCTTC ACrACCCTGC TGGACTTCrC CCACCCCGGG GACCTTGGCA TCTTCATCCA 1920 
OIAOCCCATC ATACCOGCGC TCGACGCGCA AAATGAGGCC AAOGGCATCA TGGACGGGCG 1980 
CCAGCTGGCC CTCTGCTTCA GCCTGCTCCC GGAGAACAOC CTCTACrGGA ACTACTACAT 2040 
CGACAOCTAC CTCAACGGTC AGAGCCCGCT GGCCTTCGAT CTGCTGCACr OGAACAXGA 2100 
CAGCACCAAT GTGGCGGGCA AGACCCACAA CAOCCTGCTG CGCCGTCTCr ACCTGGAGAA 2160 
CCAGCrccrC AAGCQOGACC TCAAGATCCG CAACACCGCC ATCGATCrCC GCAACCTGAA 2220 
GACCCCTGTG CTGCTGCTCT CGGCGCTCGA CGATCACATC GCCCrCTGGC ACXXCACCTG 2280 
QCAGGGCATG AAGCTGTTTG GCGGGGAGCA GCOCTTCCrC CTGGCGGACT CCGGCCACAT 2340 
GCCCGGCATC ATCAACCCGC CGGCCGCCAA CAACTACGGC TPCTCGCACA ACGGGGCCGA 2400 
GCCCGACAGC CCGGAGAGCT GGCTGGCAOG GGOGACGCAC CAXGCQGCT CCTGGTGGCC 2460 
CXAGATGATG GGCTTTATCC AGAACCCTGA CGAAGGCTCA GACCCCGTCC CCGCGCOGCT 2520 
GCCGGACGAA CGGCTGGCCC CCCCCCCCGG CCACTATGrC AACCTOCGOC TCAACCCCGT 2580 
CrrTGGCTGC CCAACAGAGG AGGACGCCOC ATG ACC GCA CAA TGC CTG GAA CTA 2634 



GGC CAC AAC GCC CCr CrC AGC AAC CGG TTC GGG GGC GCG GAG CTA OCC 2682 



Met Ser Ala Gin Ser Leu Glu Val 



1 



5 



(23) 
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43 44 
Cly Gin Lys Ala Arq Leu Ser Lys Arq Phe Gly Ala Ala Glu Val Ala 

10 15 20 

QCC TTC GCC GCG CTC TCG GAG GAC TTC AAC GCC CTC CAC CTG CAC CCG 2730 
Ala Phe Ala Ala Leu Ser Glu Asp Phe Asn Pro Leu His Leu Asp Pro 
25 30 35 40 

GCC TTC GCC GCC ACC ACG GCG TTC GAG CGG CTC ATA GTC CAC GGC ATG 2778 
Ala Phe Ala Ala Thr Thr Ala Phe Glu Arq Pro He Val His Gly Met 

45 50 55 

CTG CTC GCC AGC CTC TTC TCC GGG CTG CTG GGC CAG CAG TTG CCG GGC 2826 
Leu Leu Ala Ser Leu Phe Ser Gly Leu Leu Gly Gin Gin Leu Pro Gly 

60 65 70 

AAG GGC AGC ATC TAT CTG GCT CAA AGC CTC AGC TTC AAG CTG CCG GTC 2874 
Lys Gly Ser He Tyr Leu Gly Gin Ser Leu Ser Phe Lys Leu Pro Val 

75 80 85 

TTT CTC GGG GAC GAC CTG ACG GCC GAG CTG GAC CTC ACC GCC CTT CGC 2922 
Phe Val Gly Asp Glu Val Thr Ala Glu Val Glu Val Thr Ala Leu Arq 

90 95 100 

G^G GAC AAG CCC ATC GCC ACC CTG ACC ACC CGC ATC TTC ACC CAA GGC 2970 
Glu Asp Lys Pro He Ala Thr Leu Thr Thr Arq He Phe Thr Gin Gly 
1D5 XLO XLS 120 

GGC GCC CTC GCC CTG ACG GGG GAA GCC CTG CTC AAC CTG CCT 3012 
Gly Ala Leu Ala Val Thr Gly Glu Ala Val Val Lys Leu Pro 

125 130 
TAAGCACCGG CGGCACGCAG GCACAATCAC CCOGGCCOCT OCCGGGCTGA TTCTTCTCCC 3072 

ccGcrccccr tgcccccttt ttcggggcaa tttggcccag ccccnnrccc tgccccgcct 3132 

AACTGCCTAA AATGOCCGCC CTGCCGTCTA GGCATTCATC CACCTAG\GG AATTC 3187 
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CTGGGATCCG CCGCTGCTTA AGGCACCTTG 
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^m^l 0-108 68 2 
46 

Ser Ala Cln Ser Leu Clu Val Gly Gin Lys Ala Arg Leu Ser Lys Arg 

1 5 10 15 

Phe Gly Ala Ala 
20 



iS9»J: 

Met Ser Ala Gin Ser Leu Glu Val Gly Gin Lys Ala Arg Leu Ser Lys 

15 10 15 

Arg Phe Gly Ala Ala 
20 
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